Attorney Docket No. 2959- 104P 

Detection of Unlabeled Hybridized DNA and RNA 
Using Restriction Enzyme Digestion 

Technical Field 

This invention is related to the fields of deoxyribonucleic acid (DNA) and ribonucleic 
acid (RNA) sequencing; and genetic mapping, characterization, and diagnostics; and detection 
and identification of organisms, viruses, and nucleic acids. 

Background of the Invention 

DNA and RNA - The nuclei of living cells possess chromosomes, which contain the 
genetic information necessary for growth, regeneration and other functions of organisms. 
Instructions concerning such functions are contained in the molecules of deoxyribonucleic acid 
(DNA). The genetic information of DNA is contained in the sequence of nucleotide bases, 
which are arranged on a linear polymer of deoxyribose phosphates. The four bases are thymine 
(T), adenine (A), cytosine (C), and guanine (G). Two strands of DNA form a double helix joined 
by base pairing of Ts to As and Cs to Gs. Accordingly, the base sequence along one strand 
determines the order of bases along the complementary strand. Cells use DNA as a template for 
RNA synthesis. During RNA synthesis, a strand of RNA complementary to the template DNA is 
formed. RNA/DNA hybrids obey the same base pairing rules as DNA/DNA hybrids except that 
RNA contains uracil (U) instead of T. Both RNA and DNA, therefore, can be used to derive 
genetic information. 

DNA and RNA Sequencing - Sequencing is the determination of the sequence of bases in 
nucleic acid strands. Polymerase chain reaction (PGR) is a process in which segments of sample 
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DNA that are of interest are synthesized repeatedly until an amount is produced which is suitable 
for further experimentation. For the purpose of sequencing, PGR can contain fluorescently 
labeled, dideoxynucleotide triphosphates that stop further elongation of strands. Each of the four 
kinds of dideoxynucleotide triphosphates (A, T, C, or G) is labeled with a different fluorescent 
5 label. The products of this reaction are of various lengths and each strand has only one label that 
represents the last base added. When separated by size, the base sequence of the template DNA 
can be determined. This is the Sanger method of DNA sequencing and is, at present, the most 
commonly used method for sequencing long DNA sequences. For RNA, PGR includes reverse 
transcriptase that makes a complementary DNA from the RNA template. The DNA products are 

10 then sequenced by the Sanger method. The Sanger method requires a high quality gel and 
electrophoresis of the ampified DNA. It is a time consuming process and not well-suited for 
parallel sequencing of several sequences. 

Sequencing by Hybridization — Sequencing by hybridization (SBH) uses the specificity of 
DNA/DNA binding rules and melting temperature to sequence labeled sample DNA. 

15 Hybridization is performed at elevated temperatures close to the melting point of the fully 
complementary hybrid so that non-complementary DNA cannot hybridize to the probe. Also, 
imperfect hybridization can be followed by washes approaching the melting temperature to 
ensure that non-complementary sample DNA has been removed from the probe site. Typically, 
probes are immobilized on a solid surface. Detection of labeled DNA at the location of a probe 

20 indicates the presence of complementary sequence in the sample DNA. The solid surface 
containing the probe oligonucleotides is sometimes referred to as an SBH chip, a genosensor 
chip, a hybridization surface, or a dot blot. This version of the SBH process is referred to as 
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Format II SBH. E. Southern disclosed Format II SBH in International Application No. 
PCT/GB89/00460. 

Format I SBH is an alternative method where sample DNA is attached to a solid surface, 
such as a nylon membrane, and hybridized with labeled probes. R. Drmanac and R. 
5 Czerkvenj akov disclosed Format I SBH in US Patent No. 5 ,202,23 1 . 

Genetic diagnostic information can also be obtained using techniques similar to those 
used for obtaining sequencing information. In diagnostic applications, a known sequence that 
indicates the presence of a particular gene or organism is immobilized to the surface and labeled 
DNA from the sample is allowed to hybridize. After washing to eliminate nonspecific binding, 
1 0 detection of the label indicates that gene or organism is present in the sample. 

While the above sequencing by hybridization techniques are capable of producing 
reliable results, if properly applied, certain disadvantages are inherent. For example, the target 
DNA must be sorted and immobilized (Format I) or labeled (Format II). These procedures for 
sequencing are laborious and require a large DNA or RNA sample. If these limitations can be 

15 overcome, the parallel nature of the genosensor chip makes it ideal for several forms of 
diagnostics, forensics, and other applications. In particular, if amplification, labeling, and 
segregation of the target DNA before hybridization were not necessary, the genosensor chip 
could be used more readily as a routine diagnostic test. DNA arrays utilize parallel hybridization 
of target DNA to multiple known-sequence probes in a compact array. This allows detection of 

20 multiple sequences using SBH with only one small DNA sample [1-8]. Since the arrays are so 
compact (e.g., 100s to 1000s on a silicon chip), the reliability of diagnostic tests can be improved 
by using multiple probes. 



3 



Attorney Docket No. 2959-1 04P 

Probe Technology - Molecular beacons (described in US Patents 5,118,801 and 
5,312,728) are single-stranded DNA (ss-DNA) probes that form a stem-and-loop structure. The 
loop sequence is complementary to a sequence of interest. Binding of complementary DNA to 
the loop sequence opens the stem-and-loop structure. If no complementary DNA is present, the 
5 stem-and-loop structure remains intact. Molecular beacons do not require labeling of the sample 
DNA because the beacons have a single fluorophore attached to one end and a quencher to the 
other end. When in a stem-and-loop conformation, the ends are close together and light emission 
is quenched. When hybridization breaks the stem structure, the fluorophore and quencher are 
separated and light is emitted [91- Although the stem is only 6 or 7 base pairs in length, it has a 
10 melting temperature well above 55°C due to its intramolecular nature. Molecular beacons are 
routinely used to monitor PGR reactions in real time. 

Molecular beacons can be used to differentiate sequences that differ by only a single base 
such as alleles for drug-resistant tuberculosis, himian coagulation factor V, human estrogen 
receptor, and human methylenetetrahydrofolate reductase [10-13]. This specificity is not 
15 available with most other probe types. In addition to DNA studies, molecular beacons have been 
used to detect RNA sequences in living cells [14-16]. It has been demonstrated that molecular 
beacons still function when immobilized to sinfaces [17-18]. ; 

It is necessary that molecular beacons form the desired stem-and-loop structure. One 
useful tool is a DNA folding program available on the internet 
20 (http://bioinfo.math.rpi.edu/-zukerm/) that estimates the free energy of formation of the stem 
hybrid and predicts the melting temperature. In general, the loop areas should be approximately 
21 bases long with a GC content of 25-75 percent. The loop length can be adjusted to 
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accommodate various GC percentages. The stem structures should be 6 or 7 base pairs long and 
contain mainly Cs and Gs. 

Although molecular beacons have many good qualities, they are limited for use in 
applications other than PGR diagnostics. The use of a single fluorophore label requires the 
5 amplification of the sample DNA. Because many other compounds fluoresce and produce a high 
background, a large amoxmt of fluorophore must be present to be measured accurately. 

Label Technology - Bioluminescent enzymes, such as luciferase, alkaline phosphatase, 
horseradish peroxidase, p-galactosidase, and aequorin, have recently been used to detect 
attomolar concentrations of molecules and have been shown to be 30-60 fold more sensitive 

10 than radioactive labels [19]. They produce much less backgroimd since no excitation light is 
required. In these particular cases, light is produced continuously as long as activator is 
available, therefore, photons can be counted over a period of time. Label technology is a 
rapidly advancing field, and, for this reason, a type of probe that is easily adaptable to new . 
labels or multiple labels would be highly beneficial. 

1 5 Restriction enzymes - Restriction enzymes (or restriction endonucleases) are produced in 

bacteria, presumably to degrade foreign DNA. Methylation differences between the bacterium's 
genomic DNA and the foreign DNA protect the genomic DNA from cleavage [20]. 

Restriction enzymes bind at recognition sequences. Recognition sequences are typically 4 
to 6 bases long, but may be longer. Restriction enzymes cleave double-stranded DNA (dsDNA) 

20 at a restriction site, which may or may not be located within the recognition sequence. At each 
restriction site, one phosphodiester bond from each of the strands in the dsDNA is hydrolyzed to 
form hydroxyl and phosphate groups. The cleaved sites, one on each DNA strand, may be 
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opposite each other forming two blunt-ended dsDNA fragments, or may occur at different 
locations resulting in fragments with protruding unpaired bases called sticky ends [21]. 

Currently, almost 3000 restriction enzymes with over 200 different recognition sequences 
are known. The field of molecular cloning relies heavily on the use of restriction enzymes to 
5 create new chimeric DNA sequences from different sources [22]. This demand is met by several 
companies (New England Biolabs, Fermentas, Roche, and Stratagene, for example) that offer 
restriction enzymes and study the activities of the enzymes. Restriction enzymes are usually 
supplied with a concentrated buffer solution that optimizes activity. Typical buffer conditions are 
pHs from 7.0 to 7.9, magnesium salts from 10 to 100 mM, bovine serum albumin from 0 to 100 

10 M-g/ml, and temperatures from 30 to 65°C. One unit of restriction enzyme is defined as the 
amount necessary to digest 1 jixg of DNA in one hour under optimal conditions. 

The most commonly used restriction enzymes are highly efficient (nearly 100% of 
restriction sites cut). DNA fragment length can affect restriction enzyme activity. Frequently, 
restriction enzymes can cut short pieces of dsDNA efficiently, but some require longer 

15 fragments. Suppliers (New England Biolabs, for example) recommend having six base pairs on 
each side of recognition sites to ensure efficient cleavage when length preference is unknovra. 
Some enzymes, such as Eco RI, are known to cut efficiently with only one base on each side of 
the recognition site. 

20 Summary of Invention 

The present invention utilizes a nucleic acid probe for detecting specific target nucleic 
acid sequences. The probe is an oligonucleotide containing a covalent surface-coupling group ^ 
attached to one end and a label attached to the other end, wherein the oligonucleotide contains 



6 



Attorney Docket No. 2959- 104P 

two complementary regions which hybridize to each other in the absence of target nucleic acid 
forming a loop region which contains a sequence complementary to the target sequence being 
probed, and forming a stem region which contains a restriction enzyme site which is not 
present in the hybridized loop. 
5 The present invention further utilizes a method of determining the presence of one or 

more specific target nucleic acid sequences using such a probe. 

Brief Description of the Drawings 

Figure 1. The structure of a grobe with a removable label (PRL) attached to a glass 
J'JIO surface. G, A, and T represent the bases of nucleotides in an ssDNA molecule. The looped 
ft:, area contains the target-hybridization sequence. Stem-forming sequences, located on each side of 
ri the target-hybridization sequence, hydrogen bond to form the stem. This stem can be cut by the 

restriction enzyme Pal I. 

5 Figure 2. A summary of the processes in a preferred embodiment 

Detailed Description of the Invention 

The invention is a probe with a removable label (PRL) that can be used to detect 
hybridization without labeling target DNA. The PRL structure differs from that of a molecular 
20 beacon in three ways: 1) the stem structure contains a restriction enzyme recognition site, 2) 
different label types, including fluorescent labels, can be used and no quencher is necessary, 3) a 
moiety is present to attach the PRL to surfaces to form arrays (Figure 1). Like molecular 
beacons, the stem structure in the PRL must open upon hybridization of the loop sequence to 
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complementary target DNA or RNA. The stem structures may be approximately 4-9 nucleotides 
in length. For each PRL array, a restriction enzyme is used that has a restriction site in each stem 
sequence but does not have a restriction site or a recognition site in any of the hybridized loop 
sequences. A list of several potential restriction enzymes and their recognition sequences is given 
5 in Table 1 . Identification of additional suitable restriction enzymes can be readily determined 
based on the present disclosure of the invention and generally available knowledge in the field of 
molecular biology. For example, the PRL shown in Figure 1 has both the Pal I recognition site 
and restriction site located in the stem. The stem sequence GGCCAG, written 5' to 3', can be cut 
by Pal I forming the fragments GO and CCAG, After this fragmentation, covalent bonds no 

1 0 longer connect the label to the surface. The six weaker hydrogen bonds between the two CG base 
pairs are not strong enough to hold the label to the boimd portion of the PRL and the label can be 
washed away. Other stem sequences, used with other restriction enzymes in Table 1 , may have 
more than six hydrogen bonds remaining. Washes at elevated temperatures can be used to break 
these hydrogen bonds without damaging the covalent bonds. The conditions used for washing 

15 away the labeled nucleotide fragment subsequent to restriction enzyme digestion may be 
routinely determined by one skilled in the art based on the restriction site and restriction enzyme 
used. Such information is typically available from the company source of the restriction enzyme. 

The loop portion of the PRL should be about 16 to 25 nucleotides in length. In order to 
function properly, the hybrid formed upon hybridization of a fully complementary DNA to the 

20 loop sequence must be more energetically favorable than the stem hybrid, however, any loop 
hybrid containing mismatches must be energetically less favorable than the stem hybrid. In order 
to achieve this, the length or composition of the loop and /or stem sequences is varied. 
Commonly, a loop sequence with a high AT content requires more nucleotides in the loop 
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sequence or a weaker stem sequence. A loop sequence with a higher GC content may need a 
reduced length or a more stable stem sequence. 
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Table 1. A list of restriction enzymes and their recognition sequences. Abbreviations are as 
follows: indicates the cleavage site; indicates the second cleavage site in "sticky end" 
cleavages; N, any of the four bases; W, base A or T; M, base A or C; K, base G or T ; Y, 
base C or T; R, base G or A; S base G or C; D base A or G or T; H base A or C or T. Table 
adapted from the Restriction Enzyme Database (http://rebase.neb.com), Dr. Richard J. 
Roberts and Dana Macelis, authors. 



Enzymes 


Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 


Alul 


AG-^CT 


MM 


Baa 


TGG^CA ^ 


MM, M/w3 11, MwNI, M^cl 


BsaAl 


YAC^TR 1 


^^rBAI, MspYl, PsuAl 


BsrBl 


CCG^CTC 


AccBSl, BstDlOll, Bst3 INI, Mbil 


Btrl 


CAC^GTC 




CacSI 


GCN^NGC 




CviJI 
CvlRl 


RG^CY 


CviTI 


TG^A 


HpyCmY 


Eco47m 


AGC^CT 


Afel, Aitl, AorSlUl, Funl 


Ecoin 1 GGCX}CC 


Egel, Ehel, Sfol 


EcolCKL 


GAG^CTC 


Ecineil, Eco53kl, Mxal 


FnuDll 




Accll, BceBl, Bepl, Bpu95l, Bsh\236\, BspSOl, Bspl23l, 
BstFNl, Bstm, Bsul532l, Btkl, Csp6SKyi, CspKYl, FauBll, 
Mvnl, Thai 


Hael 




- 


Haelll 


GG^C 


Bee All, Biml9ll, Bme36\l, BseQl, Bshl, BshFl, Bspllll 
BspBKL, BspKl, BspRl, &wRI, Btel, Cltl, Dsall, FnuDl, 
MchAll, MfoAl, NgoFll, NspLKl, PaK, Pdel33l, P/7KI, Plal 
Sbvl, Sfal, Sual 


Hindu 


GTY^RAC 


HmJCl Hinoll 


HpySl 


GTN^NAC 


HpyBll 


Lpnl 


RGC^CY 


BmeUll 


Mstl 


TGC^CA 


Acciei, Aosl, Avill, Fdill, Fspl, Nsbl, Paml 


Nael 


GCC^GC 


Cco\ Pdil 5awBMKI, SauHPl SauL?l Saum. SauSl 
Slulllll 


NlalV 


GGN^NCC 


Aspm, BscBl BspLl Pspmi 


Nrul 


TCG^GA 


BspS^l MIuB2l Sbol3l Spol 


NspBll 


CMG^KG 


Msp All 


PmaCl 


CAC^TG 


Bbr?l BcoAl Ecolll Pmil 



10 



Attorney Docket No. 2959- 104P 



Enzymes 


Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 


PvuU 


CAG^TG 


BavL, Bav Al, BavBl, Bspl53Al, BspM39l, BspO^l, Cfrbi, 
Dmal Ecil NmeRI, Paellkl Pvm84II 


Rsal 


GT^AC 


Afal HpyBl PlaAll 


Scil 


CTCXjAG 


- 


Smal 


CCCXjGG 


CfrJ4l, PaeBl PspALl 


Sr/l 


GCCC^GGGC 




Sspl 


AAT^ATT 




Stid 


AGG^CT 


Aatl AspMl EcoMll Gdil PmeSSl, Sari, Sru3Qm, SseBl, 
Stel 


Type II restriction enzymes cleaving outside their recognition sequences: 


Bsrl 


ACTG_GN^ 


BseXl, BseHl, BsrSl, Bstl 11, Tspll 


Bst¥5l 


GGATG_NN^ 


BseGl 


Btsl 


GCAGTG_NN^ 


Type II restriction enzymes producing 5* overhangs: 


Accl 


GT^MK_AC 


FbR,Xmil 


Acil 


c^cg„c 




Acyl 


grx;g_yc 


Ahall, Aosll, AstWl, Astilll, Bbill, BsaHL, BstACl Hgil, 
HgfDl, HgiGl, HgiHll, Hinll, Hsp92l, Msplll, Pamll 


Ajim 


A^RYG_T 




Agel 


A^CGG_T 


AsiAl, BshTl, CspAl, PinAl 


ApdLl 


G^TGCA_C 


AlwAAl, Snol, Vnel 


Ascl 


GG^GCG_CC 




Aspim 


G^TAC_C 


Acc65l,AhaB%\Sthl 


Asul 


G^NC_C 


AspS9l, Avcl, Bac36l, Batlim, BavAll, BavBll, Bcelll, 
BshKl, BsiYl, 18941, BspBll, Bsp¥4l, Bsu54l, Ccul, 
CfrlSl, MaeKSlll, NsplY, Nsplllll, Pdelll, PspFl, Sau96l 


Asull 


ttx:g_aa 


Acpl, ^^plOHI, Biml, Bim\9l, Bpul4l, BsiCl, Bspll9l, 
BspLAll, BspTlQ4l, BstBl, Cbil, Csp45l, CspSSKll, Fspll, 
Lspl, Mai, NspW, Plall, PpaAl, S/ul. Sspll, SspRFl, Svil 


Aval 


C^YCGR_G 


AmaSll, Aqul, Bcol, Bsel5l, BsoBl, BstSl, EcoSSl, EcollU, 
Nsplll NspSAl PlaAl PunAl 


Avail 


G^WC_C 


Afll, Aspl45l, BaniNxL, BcuAl, BmelU, Bme2l6l, BsrAl, 
BthAl, Caul, Csp6SKl, DsalY, EagMl, Eco47l, Erpl, Fdil, 
FspMSl, Fssl, HgiBl, HgiCll, HgiEl, Hgimil, HgiJl, Kzo49l, 
SinlSmuEl, KpaKllBI, 


Avrll 


C^TAG_G 


AvrBll, BM, Bsp All, XmaJl 


Bamm 


G^ATC_C 


AccEBl, Alii, ApaCl, Asil BcelSll, Bnal, Bsp9Sl Bsp4009l, 
BspAAlll, Bstl, Cell, Mu23l, Nsp29l32ll, NspSAlV, OkrAl, 
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1 Enzymes 


1 Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 




PJISl, RspLKll, Son, Surl 


BbvCl 


j CC^TCA_GC 


Abel 


1 Betl 


1 W^CGG W 


Bcalll, BsaWl 


1 BpulOl 


1 CC^TNA__GC 


BpuDl 


j BseFl 


1 G^GCG_C 


BssHll, BstBZl53lj Paul 


Bsil 


rc^ACGA_G 


BssSl, BstlBl 


BspllOl 


G^GGCC C 


PspOMl 


RvrtWAJJ 




Acclll, AorUm, BbvAlll, BseAl, BsMl, BspUl, BspEl, 
Bsu23l CauB3l, Kpnll, Mrol, PinBll, Ptal 


BstNl 


CC^W GG 


1 Aeul, Aorl, ApaOBl, Apyl, BciBll, BselSl, Bselll, Bse24l, 
BseBl, BshGl, BsiLl, Bspm, Bstll, Bstll, Bst3Sl BstlOQl, 
BstUei BstOl, BsaUl BthDl, BthEl, Cbrl, Cthll, Fspl604l, 
Mval, Snil Sthl 111, TaqXl, Zanl 


BstEll 


GXjTNAC c 


Acrll, AspAl, Bse64l, BseT9l, BseTlOl, BsiKl, BstFl, BstT9l 
BstTlOLEcal Ecil25l Eco9U Ecri06^l FrnC)]9Rl 
I iV:y;7SAII, PspBl 


Caull 




Ahal, Asell AsuCll, Bcnl, HgiS22l, M^/I448 11, Neil 


Cfrl 


Y'^GGCC_R 


Bfmi, Eael, EcoHK3ll 


CfrlOl 


1 R^CGG_Y 


Bco\ 181, Bsel 181, Bse634l, BsrFl, BssAl, BstBlSl 


CviAll 


1 C^AT_G 




Cv/QI 


G^TA_C 


Csp6l, Cv/RII j 


Ddel 


C^TNA_G 


BstBEl, BstJZ30ll 


Drall 


RG^GNC_CY 


EcoOl09l 


Dsal 


C'x:ryg_g 


BstDSl, Btgl, TspBl 


EcoYil 


x:csGG_ 


EcoimiKpn49kll 




\^\^ VV VJvJ 


PspGl, Siel, SspAl 






Mrom, NgoAlY, NgoMlV, Scelll, Srll 


Esp\ 




Blpl, Bpul 1021, Bspll20l, CeRl 


Fnu^m 


GC^N_GC 


BsoEl, BspSl, Fbrl, FspAHl, Ital, Satl, Uur960l 


Gdill 


C^GCC_R 






G'^YRC_C 


AccBll, Banl, BbvBl, BshNl, BspTlOll, Eco64l, HgiUl, 
MspB4l, PfaAl 


HinfL 


G^ANT_C 


Cv/BI, F«wAI, Hhall, SscLll 


HmPll 1 


G^G_C 


Hinei, Hsol, HspAl, 5c/NI 


Hpall j 


C^CG_G 


Bco27l, BsiSl, Bst40l, BsuFl, Cbol, Hapll, Hin2l, Mnol, 
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Enzymes 


Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 






Mspl, PdeUll, Sthl34l 


HpyllSm 


TC^NN_GA 


Hpyimil 


Kasl 


GXjCGC_C 




Mael 


C^TA_G 


Bfal, Fgol, MthZl, Rmal, Xspl 


Maell 


A^G_T 


HpyCmiY 


Maelll 


^TNAC_ 


- 


Mbol 


^ATC_ 


AspUDl, Bce243l, BfiSll, Bmelll, BscFl, Bsp67l, BsplOSl, 
BspU3l, Bsp2095l, BspAl, BspFl Bspll, BstENll, Btkll, Cad, 
Ccyl, Cpfi, Cv/AI, Dpnll, FnuCl, FnuEl, Had, Kzo9l, LlaAl, 

SauMl,Sth36^l 


Mlul 


A^GCG_T 


BbaAl 


Narl 


GG^G_CC 


Mch\ Mlyl 131, Ndal, Nunll, SseAl 


Ncol 


C^ATG^G 


Bspm 


Nhel 


G^TAG_C 




Notl 


GC^GCC_GC 


Cc/NI, CjpBI, MchAl 


PpuMl 


RG^WC_CY 


pjaii, Ppuxi, Pspsii, Pspvm 


Rsrll 


CG^WC_CG 


Cpol, Cspl, Rsr2l 


Sail 


G^TCGA_C 


BspMKl, HgiCm, HgiDll Nopl, RflFl Rtrl, Rtr63l, Xcil 


SanDl 


GG^WC_CC 


Sseinsi 


Said 


CC^TNA_GG 


Lmu6Ql, Mstll, Oxam, SshAl 


ScrFl 


CC^N_GG 


Bmel390l Msp61l, Msp?3l 


Seel 


C^NNG_G 


BsaJl, BseDl, BssECl 


Sell 


^GCG_ 


- 


SexAl 


A^CWGG_T 


Mabl 


Sfel 


C^TRYA_G 


BdiSl Bfinl, BstSFl LlaBl, Sfd 


SgrAl 


CR^CGG_YG 




Siml 


GG^TC__ 


~ 


Smll 


C^TYRA_G 




Spn 


C^TAC_G 


BpuBSl, BsiWl, BvuBl MaeKSlI, PJI2311, PpuAl PspLl, 
Sunl 


Sse232l 


CG^CGG_CG 




Ssoll 


^CNGG_ 


BssKl, BstUZeUl, DsaV, EcIlSkl, EcoUkl, Eco2lkI, \ 
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Enzymes 


Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 






EcoUiyi, Kpnlkl, StyD4l 


Styl 


C^WWG„G 


BsmSl, BssTll, CfrBl, EcoUOl, EcoTUl, Erhl, ErhB9ll 


Tagl 


T^G_A 


PpaMl, 75/7321, Tsp32ll, T/AHBSI 


Tsel 


G^WG_C 


Acel, Taq52l 


TspA5\ 


^TSAC_ 


HpySllNmuCl 


Unbl 


^GNCC_ 


- 


VpaK.n Al 


^GGWCC„ 


- 


Xhol 


C^TCGA G 


Abrl, Blul, BspAAl, BssUl, BsNl, Ccrl, Mavl, MlaAl, 
Soim79lStrl,Tia,Xpal 


Xholl 


R^ATC_Y 


Bhm, BstXll, BstYl, Dsalll MJR, Psul, TrulQU 


Xmal 


C^CCGG_G 


Akyl Cfr9l, EaeAl, EciRI, Pac25l, PspAl, Xcyl, XmaCl 


Xmalll 


C^GGCC_G 


Aaal, BseXSl, BstZl, Eagl, EcIXl, Eco52l 5€«PT16I 


Type II restriction enzymes producing 3* overhangs: 


Aatll 


G_ACGT^ 


Ssp5230l 


Acell 


G_CTAG^C 




Apal 


G_GGCC^ 


Ppel 


Bbel 


G_GCGC^ 




BseSl 


G_KGCM^ 




BspKTSl 


G_AT'X 




Bsrl 


ACTG_GN^ 


Bsell, Bsem, BsrSl, Bstl 11, Tspll 


BstFSl 


GGATG_NN^ 


BseGl 


BstXl 


CCAN_NNNN^NTGG 


BstHZ55l 


Chal 


_GATC^ 




Fmul 


G__GNC^ 




Fsel 


GG_CCGGX:C 




Haell 


R_GCGC^Y 


AccB2l, Bspl43ll, Bstmi 


HgiAl 


G_WGCW^ 


Alw2ll Aspm, BbvUl, Bsh45l, BsiFSKAl, MspW2m 


HgUU 


g^rgcy-n: 


Banll, Bpul, Bsp5l91, &m 18541, Bvul, Eco2Al, EcolSKl, 
&£?T38I, Fr/OI, Koxll, PaeBl, SacNl 


Hhal 


G_CG^ 


AspLEl, BsphAl, BstmU, Cfol, F«wDIII 


Hpy99l 


_CGWCG^ 
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Enzymes 


j Recognition Sequence 


Other Enzymes Capable of Cutting These Sequences 


HpylSSl 


TC_N^A 




HpyCRAl 


C_AT^G 




Kpnl 


G_GTAC^ 


- 


Mcrl 


j CG_RY^G 


BsaOl, BshUSSl, BsfEl, BstMCl 


Nlalll 


j _CATG^ 


HmlU,Hsp92U 


Niacin 


C_YCGRXj 




Nspl 


R_CATG^Y 


BsmSl, NspUl PauAl, PunAll, Xcel 


Psp03l 


G_GWC^ 


- 


Pssl 


RG_GNC'^Y 




Pstl 


C_TGCAXj 


Ajol AliAJl, Apil, AsplUl, Bsp63l, BspBl, BsuBl, CJIl, 
CfrA4l, Cfull, Cstl, EclSm, EcilzL, Halil, MhaAl, PaePI, 
P/721I, Psp23l, Sam, Sfll, Yenl 


Pviil 


CG_AT^G 


AfallMl, Afa\6Rl, BspCl, EagBl, ErhB9l, Mvrl, Nba, Piel9l\ 
Psul6llRshlXorll j 


Sad 


g_agctx: 


Pspl24Bl, Sstl 


Sacll 


cc gc^gg 


CfrA2l, Cscl, Eae46l, £co29kI, Gall, Gcel, GceGLl, Kpn37SU 

fCs:n\ Ncrn AJIJ NcmPJU Pnf>\l Pn^CW Pn£>^\c\ Prf£f'\A\i'^ i 
-tv-j/^Aj i L/>ri.iiA, jv^c/r^iii, /^«c;r\i, /^tteV^l, rCiSDiSJ., ru&LHKL, a 

SchZl, SenFTUhl, SexBl, SexCl, Sfr303l SgrBl, Spul, Sstll 


1 Sdiil 


g_dgch^c 


Aocll Bmyl, BsoCl, BspllSSl BspLS2l, Mhll, Nspll 


1 Sgfl 


gcg_at^cgc 


AsiSl 


1 Sphl 


g_catg^ 


Bbul, Pael, PfaAlll, RspLKl, SpaUl 


Sse%3%ll 


CC_TGCA^GG 


Sbfi, Sdal 


Tail 


_ACGT^ 


Tscl, TspA9l 


Taid 


G_CSGX: 




TspACl 


AC_N^T 


BstACl HpyCU4m, Taal 



In the preferred embodiment shown in Figure 1, an aminopropanol moiety is at the 3' end 
of the PRL. This group can be used to covaiently attach DNA oligonucleotides to glass surfaces 
and form arrays [23]. A biotin moiety is bound to the 5' end of the PRL (Figure 1). Biotin and 

streptavidin have a strong affinity for each other and are routinely used as stable linking agentsi 

j! 

Bioluminescent enzymes can be conjugated to streptavidin. When streptavidin-enzyme conjugate 
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and biotin-labeled PRLs are incubated together, the bioluminescent enzyme becomes attached to 
the PRLs. Polythymine nucleotide spacers are located at the ends of the PRL (Figure 1). The 
number of thymine nucleotides in these spacers can be adjusted to decrease steric hindrances of 
restriction enzyme activity. Other organic moieties^ such as alkane chains, could be used as. 
spacers. • 
This embodiment demonstrates the utility of this invention for determining the presence 
or absence of specific RNA or DNA sequences (Figure 2). The invention will result in a product 
that could be supplied as an array of bound, bioluminescently-labeled PRLs. Each PRL would 
have a stem structure where two complementary regions of the same molecule hybridize. The 
array will have the streptavidin-bioluminescent enzyme conjugate attached to the PRLs. DNA 
and/or RNA will be purified by a clinician according to standard methods, which are available in 
several nucleic acid purification kits. The purified nucleic acids may be sheared into smaller 
pieces suitable for hybridization and incubated in a buffered solution with the PRL array for 
hybridization to occur. The temperature and conditions of the hybridization step will be' 
determined by the stability of the possible individual hybrids in the PRL array. A temperature 
that reduces non-specific binding of target sequences that are not completely complementary to 
the array loop sequences can be determined for each array of PRLs. After hybridization occurs, 
the array will be washed and restriction enzyme buffer and restriction enzyme will be added. 
During incubation with the restriction enzyme solution, the PRLs that are in a hairpin 
configuration will be cut inside the stem sequence. This cleavage will leave only a few hydrogen 
bonds holding the label to the surface. The bonds will be unstable and break at room 
temperature, releasing the label from the bound end of PRL. The array will then be washed again 
to remove the mbound labels and restriction enzyme. The remaining bound bioluminescent 
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enzyme labels will be detected by incubating the array with the appropriate substrate solution 
and measuring photon production. Photon production at a probe site signifies that the sequence 
within the loop portion of the probe is complementary to a sequence in the target DNA or RNA. 
Lack of photon production at a PRL sight indicates the absence of a sequence in the target DNA 
or RNA that is complementary to the probe's loop sequence. 

The simplicity and speed of this assay should allow routine use in areas where a parallel 
genetic test would be advantageous but is often too difficult or time-consuming. Examples of 
uses include genetic screening for human mherited diseases, determining identity or parentage, 
identifying pathogens and their antibiotic resistance capabilities, and screening foods or materials 
for contaminating organisms and viruses. 

Other embodiments would include other labels, such as radioisotopic, isotopic, 
enzymatic, chromogenic, fluorescent, chemiluminescent, or primary and secondary antibodies 
that are either attached to the PRL during synthesis or added in a later step (before or after 
hybridization or restriction digestion have occurred). Many new methods for labeling, such as 
multiple labels, fluorescent beads, and microparticles, are currently being developed and may be 
used with PRLs in the fixture [24], 

The PRLs can be attached to glass or silicon surfaces according to several other methods 
[25]. PRLs can also be attached to gold or other metal surfaces via sulfliydryl moieties and to 
microtiter wells via a biotin-streptavidin linkage [26]. 

Labels and immobilization moieties can be located anywhere along the length of the 
PRL, including the loop sequence region, providing that their location does not interfere with 
hybridization and that the restriction site lies between the surface-attachment site and the 
position of the label. 
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The PRL may have different types of spacers or multiple spacers for the purpose of 
raising the PRL above the surface; to reduce steric hindrances during the hybridization, washing, 
restriction enzyme digestion; etc. In addition, oligonucleotides can be constructed for use as 
controls that always remain or are always removed from the surface during the assay. 
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